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Abstract 

We describe a simple algorithm based on a Markov chain process to generate 
simply connected acyclic directed graphs over a fixed set of vertices. This algo- 
rithm is an extension of a previous one, designed to generate acyclic digraphs, non 
necessarily connected. 
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1 Introduction 

Combinatorial objects can sometimes be investigated through random generation 
in order to discover various properties, estimate statistics or test conjectures. Also, 
objects might be generated randomly to test the behaviour or performance of al- 
gorithms, with the hope that all possible or interesting cases will be obtained from 
the generation process. For example, graph drawing algorithms can be tested or 
studied by running them over a large set of examples. This approach is widely used 
and many authors provide test sets on the web (see E] for instance) . 

Graphs are widely used by non-mathematicians as a modelling tool in various 
fields (social sciences, computer science and biology to name just a few). A com- 
mon approach is to develop a theoretical but specific framework based on graphs 
and to prove its relevancy and usefulness by experimentally testing it on many ex- 
amples. However, the lack of "real-world" example sets motivate the construction 
of artificial ones. Randomly generating graphs appears as a solution, provided that 
the generating process can be somewhat controlled or kept within a given class of 
graphs. 

Acyclic digraphs (directed graphs containing no circuits) prove to be appropriate 
in many situations. The absence of circuits can indeed have different interpretations 
(logical ordering of tasks or processes, ordering induced from time, etc.). Many 
authors have studied their properties providing insight on the global and average 
shape of acyclic digraphs [01 E3 121 El El • An algorithm generating acyclic digraphs 
uniformly at random has only been published recently 7 j. However, the authors did 
not provide any specific conditions showing how to use their algorithms to generate 
simply connected acyclic digraphs. This condition is often seen as being central in 
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many applications and motivated our work. As we shall see, a careful investigation 
of the Markov chain introduced in [Jj reveals how and why the same algorithm 
can be slightly modified and used to generate simply connected acyclic digraphs 
uniformly at random. 

This note is organized as follows. We first recall basic results and notations and 
describe the Markov chain introduced in :;7]. We then discuss the conditions under 
which it can be adapted as to provide an algorithm for generating simply connected 
acyclic digraphs. The last part of the note is devoted to the proof of our result. 

2 A Markov chain algorithm 

Let N > 2 be a fixed integer and let V = {1, ... , N} denote a finite set of vertices. 
We consider the set A of all acyclic directed graphs over V, that is, graphs containing 
no circuits 1 . Next, we define the Markov chain M over the set A. Each state of 
M is an acyclic digraph in A. Because the set V of vertices is fixed, we will not 
distinguish between a digraph in A and the set of its arcs. The transition between 
any two states in M is given as follows. Let Xt denote the state of the Markov chain 
at time t. Suppose a couple of integers has been drawn uniformly at random 
from the set V x V. 

(Ti) If is an arc in Xt, it is deleted from Xt- That is, Xt+i = Xt\ 
(T2) If is not an arc in Xt, then 

(i) it is added to Xt, provided that the resulting graph is acyclic. That is, 
X t+1 = X t U{i,j). 

(ii) otherwise, nothing is done. That is, Xt+i = Xt. 

The probability of a transition going from a state X to a state Y 7^ X is either 
or 1/N 2 . Note that the transition rules make the transition matrix A symmetric. 
This is a key property in proving that the Markov chain converges towards the 
uniform distribution. More precisely, we have: 

lim A n ■ d = 1, (1) 

n^oo 

where d is any initial distribution over A and 1 stands for the normalized vector 
(j-^l , . . . , The other key element in the proof is that the state space of M 

is irreducible. That is, given any two acyclic digraphs G\ and G2, there exists a 
sequence of transitions going from G\ to G<i- This is straightforward since, starting 
from G\, one can travel by rule (Ti) to the graph with no arc (which is obviously 
acyclic) then by rule (T2) to G2. 

This result can be converted into an algorithm for randomly generating acyclic 
digraphs, each having the same probability of being drawn. Indeed, condition (T2) 
relies on the ability to test whether the addition of would create a circuit, 
which can be easily done pQ. Hence, starting from a graph with an empty set of 
arcs one can iteratively apply the rules (Ti) and (T2) to build an acyclic digraph. 
Applying those rules a large (but finite) number of times leads to an algorithm, 
which randomly outputs acyclic digraphs (with an almost uniform distribution). 

1 Recall that a circuit in a directed graph G is a sequence of vertices vq, v\, . . . , Vk — vq where each 
(vi, Vi+x), i = . . . k — 1, is an arc in G. 
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3 Customizing the algorithm 



In Theorem 1], the authors prove that random acyclic digraphs have iV 2 /4 arcs 
on average. However, in many applications, the total number of edges in graphs is 
proportional to their number of vertices 2 . Hopefully, the chain can be adapted to 
these situations by imposing further conditions on the transition (T2). For instance, 
one can allow the insertion of a new arc provided that, additionally to condition 
(T2), the total number of arcs in the graph does not exceed a given upper bound. 
The condition can also be refined in order to impose an upper bound on each vertex 
individually. 

Two observations must be made here. First, it is important to mention that ad- 
ditional conditions imposed on transitions (T\) or (T2) actually induce a restriction 
on the state space on which they act. The corresponding transition matrix may not 
be symmetric any more, so that other sufficient properties have to be established. 
Also, the irreducibility of this space has to be established before we can conclude 
that Eq. Q holds for the restricted transition matrix. However, the situations con- 
sidered in [7j are rather simple and in each case the irreducibility of the restricted 
space is straightforward. 

The contribution of this short note is to show how the Markov chain can be 
adapted to produce simply connected acyclic digraphs 3 . In the sequel, we will refer 
to simply connected graph as connected graph, for sake of brevity. 

In many applications, this appears to be an essential property in the modelling 
phase. Indeed, social networks are rarely made of several disconnected components, 
since it is exactly the interactions between the actors of the network that moti- 
vates their study. Also, disconnected acyclic digraphs are irrelevant in modelling 
task synchronization problems, since tasks belonging to distinct components can 
be performed independently. As a further example, chronology forces acyclicity in 
Bayesian networks. 

An obvious approach one can adopt in order to adapt the Markov chain M to 
generate connected acyclic digraphs is to impose on the rule (Ti) an additional con- 
dition: the deletion of an arc should only be allowed if the arc is not disconnecting, 
that is, if the resulting graph is still connected. Furthermore, a disconnecting arc 
belongs to no (undirected) cycle, so that it may be reversed without creating a cir- 
cuit. This seemingly superfluous rule has two advantages at least: it enables further 
transitions, slightly shortening the diameter of the transition graph, and it makes 
the chain irreducible for N = 2. Observe that the modified conditions preserve the 
symmetry of the transition matrix, admitted that the state space underlying this 
restricted Markov chain is now the set of all connected acyclic digraphs. So, for 
instance, the graph with no arc does not belong to the state space anymore. The 
minimal number of arcs is now N — 1. Each of these connected digraphs with N — 1 
arcs can actually be built by constructing a (non directed) tree and by assigning 
appropriate orientations to its edges. 

For sake of clarity, we denote by A' the set of all connected acyclic digraphs and 
modify the transition rules as follows. Take uniformly at random (i,j) in V 2 . 

(T-[) If (i,j) is an arc in Xt then 

When generating graphs to test drawing algorithms, for instance, it makes no sense to generate 
graphs with too many edges. Graphs having AN edges are actually considered as dense in the Graph 
Drawing community. 

3 Recall that a directed graph G is simply connected if its underlying non directed graph is connected. 
That is, if for any two vertices u, v there exists a sequence of vertices u = vq, vi, . . . , v p — v such that 
either i>;+i) or i>j) is an arc in G (i = 0. . .p — 1). 
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(i) if it is not disconnecting, it is deleted from Xt- That is, Xf+\ = Xt \ 

(ii) if it is disconnecting, it is reversed. That is, X t +\ = X t \ (i,j) U (J, i). 
(T2) If is not an arc in Xt, then 

(i) it is added to X t , provided that the resulting graph is acyclic. That is, 
X t+1 = X t U(i,j). 

(ii) otherwise, nothing is done. That is, X t+ \ = X t . 

Write M' for this Markov chain running under the rules (T-Q and (T2). Since the 
transition matrix is symmetric, the problem we face is to show that the state space 
A 1 underlying M' is irreducible. Observe that, contrarily to what was done in 
this is not straightforward. Indeed, it is not immediate to see that given two 
connected acyclic digraphs G\ and G2 there exists a sequence of transitions from 
G\ to G2 and going through connected acyclic digraphs only. The proof is given in 
the next section. 

In [2j the authors mention an algorithm by Alon and Rodeh to test for circuits 
in directed graphs which can be used for conditions (I2). Any algorithm testing a 
directed graph for simple connectedness can be used for condition (T[). This can 
be performed in a time proportional to the number of arcs in the graph (using a 
depth first search for instance, see, e.g., the textbook by Jungnickel 

The complexity of the algorithm is dominated by the total number of iterations. 
The problem of estimating a good upper bound of this number is a difficult one 
and is at the centre of the theory on Markov chains . As we will see in the next 
section, the transition graph is compact, with diameter in the order of N 2 . 

4 Irreducibility of the Markov chain 

We shall now give the complete proof of the irreducibility of the Markov chain M'. 
It is worthwhile to point out that the result holds, with the same proof, if A' is 
replaced by any subclass containing all digraphs on V with N — 1 and N arcs. 

Theorem 1 Let M' be the Markov chain defined over the space A' of all connected 
acyclic digraphs, together with the transitions defined by the rules (T{) and (T2). 
The chain M' is irreducible, that is, given two connected acyclic digraphs G and 
H, there exists in M' a sequence of transitions G = Go — > G\, G\ — > G2, 
Gp-i — > G p = H (p > 1) with all Gi E A' (i = 0. . .p). Moreover, such a sequence 
exists with length at most (N + 7)(N — 3/2). 

Proof of TheoremUl 

Since transitions in M' are reversible, it suffices to show that there exists a sequence 
of transitions from any given connected acyclic digraph G to a fixed element in A'. 
We proceed in three steps. Let H be any acyclic digraph over V, denote by H the 
undirected graph obtained from H by ignoring the orientations of its arcs. 

Step 1 There exists in M' a sequence of transitions from G to a directed tree T 
(i.e., an acyclic digraph such that T contains no cycle). Indeed, T can be obtained 
by computing a spanning tree for G. The transition is then defined by a sequence 
of arc deletions by rule (T[), leaving only the arcs present in the spanning tree. 

Step 2 There exists in M' a sequence of transitions from T to a directed chain C 
(i.e., a directed tree over V such that C is a chain; for short, a dichain). We proceed 
by diminishing the number £(T) of the leaves in T. Suppose £(T) > 2. Starting 
from a leaf u, one can find in T a simple path u = u\, U2, ■ ■ ■ , u p = w such that Ui 
has degree 2 for all % = 2, . . . ,p — 1 and w has degree at least 3. Let v 7^ U2 be 
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another neighbour of w (see the left-hand part of Figure ^ where the dashed lines 
stand for one or more edges and the dotted ones for possible edges). By rule (T2), 




Figure 1: Turning the leaf u into a vertex of degree 2 

either the arc (v, u) or (it, v) can be added to T without creating a circuit, after 
what the arc between w and v cannot be disconnecting. Deleting it by rule (T[), 
we get a directed tree T' such that £(T') = £(T) — 1, because w has still degree at 
least 2 and u is not a leaf anymore. The result follows by iterating until £(T) = 2. 

Step 3 There exists in M' a sequence of transitions from C to any Hamiltonian 
directed chain, say + 1),1 < i < N}. By rule (T{) indeed, this holds for 
N = 2 trivially. Set N > 3. By rule (T[), it suffices to show that there is in M' 
a sequence of transitions from C to one of the dichains associated with the chain 
{{i, % + 1},1 < i < N}. Let C = {{vi, 1 < i < N}, we first bring C into a 

dichain C such that C' = {{v' i} v' i+1 }, 1 < i < N} and v[ = 1. If there is j > 1 such 
that Vj = 1, an arc between vjy and t>i may be added to C by rule (T2), and the arc 
in C relative to the edge {vj-\,Vj} may then be deleted by rule (T{) (see Figure|2l 
where orientations of the arcs have been omitted for sake of simplicity, and vertices 
joined by dotted paths are possibly identical). Now v[ = 1 in C', let k be the least 




1 V N — V! Vj 



Figure 2: Turning the vertex 1 into a leaf 

integer such that v' k ^ k. Suppose k < N, then the (simple) path in C' from 1 to 
k is longer than the one from 1 to v' k . We next bring C' to a dichain C" such that 
C" = {{v", v" + i}, 1 < i < N} with v'l = i for 1 < % < k. Since its justification 
is quite similar to the previous one, we only describe the construction in Figure 
Iterating it until k = N completes the proof of the third step. 



Figure 3: Placing the vertex k (orientations are omitted). 

Finally, let G\ and G2 be in A'. Then Gi (i = 1, 2) may be turned into a directed 
tree Tj by N(N — l)/2 — (N — 1) transitions at most, the worst case obtaining when 
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Gi is a tournament. Then Tj may be turned into a dichain d by 2((iV — 1) — 2) 
transitions at most, the worst case obtaining when Ti is a star. Finally, C\ may be 
made a Hamiltonian dichain by (N — l)/2 arc reversions at most, and Ci may be 
turned into the latter Hamiltonian dichain by 2 + 4(N — 2) transitions at most. 

Remark For N > 3, the reversion rule T[(ii) is not indispensable for Theorem ^ to 
hold. It is not clear whether it fastens convergence of the process or not. 
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